NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Centralization in the Decentralized Web: Challenges and Opportunities in IPFS Data Management

Shi, Ruizhe; Cheng, Ruizhi; Fu, Yuqi; Han, Bo; Cheng, Yue; Chen, Songqing (April 2025, The ACM Web Conference (WWW 2025))

The InterPlanetary File System (IPFS) is a pioneering effort for Web 3.0, well-known for its decentralized infrastructure. However, some recent studies have shown that IPFS exhibits a high degree of centralization and has integrated centralized components for improved performance. While this change contradicts the core decentralized ethos of IPFS and introduces risks of hurting the data replication level and thus availability, it also opens some opportunities for better data management and cost savings through deduplication. To explore these challenges and opportunities, we start by collecting an extensive dataset of IPFS internal traffic spanning the last three years with 20+ billion messages. By analyzing this long- term trace, we obtain a more complete and accurate view of how the status of centralization evolves over an extended period. In particular, our study reveals that (1) IPFS shows a low replication level, with only 2.71% of data files replicated more than 5 times. While increasing replication enhances lookup performance and data availability, it adversely affects downloading throughput due to the overhead involved in managing peer connections, (2) there is a clear growing trend in centralization within IPFS in the last 3 years, with just 5% of peers now hosting over 80% of the content, significantly decreasing from 21.38% 3 years ago, which is largely driven by the increase of cloud nodes, (3) the default deduplication strategy of IPFS using Fixed-Size Chunking (FSC) is largely inefficient, especially with the default 256KB chunk size, showing near-zero duplication being detected. Although Content-Defined Chunking (CDC) with smaller chunks could save ∼1.8 petabytes (PB) storage space, it could impact user performance negatively. We thus design and evaluate a new metadata format that optimizes deduplication without compromising performance.
more » « less
Free, publicly-accessible full text available April 24, 2026
Better Reliability Compression: Model Pruning with Calibrated Uncertainty Estimation for Mobile Deep Learning Applications

https://doi.org/10.1109/MOST65065.2025.00012

Ma, Runyu; Chen, Songqing; Yao, Shuochao (May 2025, IEEE)

Free, publicly-accessible full text available May 4, 2026
A First Look at Immersive Telepresence on Apple Vision Pro

https://doi.org/10.1145/3646547.3689006

Cheng, Ruizhi; Wu, Nan; Varvello, Matteo; Chai, Eugene; Chen, Songqing; Han, Bo (November 2024, ACM)

Full Text Available
MetaFL: Privacy-preserving User Authentication in Virtual Reality with Federated Learning

https://doi.org/10.1145/3666025.3699322

Cheng, Ruizhi; Wu, Yuetong; Kundu, Ashish; Latapie, Hugo; Lee, Myungjin; Chen, Songqing; Han, Bo (November 2024, ACM)

Full Text Available
Dynamic 6-DoF Volumetric Video Generation: Software Toolkit and Dataset

https://doi.org/10.1109/MMSP61759.2024.10743552

Zhu, Mufeng; Sun, Yuan-Chun; Li, Na; Zhou, Jin; Chen, Songqing; Hsu, Cheng-Hsin; Liu, Yao (October 2024, IEEE)

Full Text Available
Alps: An Adaptive Learning, Priority OS Scheduler for Serverless Function

Fu, Yuqi; Shi, Ruizhe; Wang, Haoliang; Chen, Songqing; Cheng, Yue (July 2024, Proceedings of the 2024 USENIX Annual Technical Conference)

Full Text Available
ALPS: An Adaptive Learning, Priority OS Scheduler for Serverless Functions

Fu, Yuqi; Shi, Ruizhe; Wang, Haoliang; Chen, Songqing; Cheng, Yue (July 2024, 2024 USENIX Annual Technical Conference (ATC 2024))

FaaS (Function-as-a-Service) workloads feature unique patterns. Serverless functions are ephemeral, highly concurrent, and bursty, with an execution duration ranging from a few milliseconds to a few seconds. The workload behaviors pose new challenges to kernel scheduling. Linux CFS (Completely Fair Scheduler) is workload-oblivious and optimizes long-term fairness via proportional sharing. CFS neglects the short-term demands of CPU time from short-lived serverless functions, severely impacting the performance of short functions. Preemptive shortest job first—shortest remaining process time (SRPT)—prioritizes shorter functions in order to satisfy their short-term demands of CPU time and, therefore, serves as a best-case baseline for optimizing the turnaround time of short functions. A significant downside of approximating SRPT, however, is that longer functions might be starved. In this paper, we propose a novel application-aware kernel scheduler, ALPS (Adaptive Learning, Priority Scheduler), based on two key insights. First, approximating SRPT can largely benefit short functions but may inevitably penalize long functions. Second, CFS provides necessary infrastructure support to implement user-defined priority scheduling. To this end, we design ALPS to have a novel, decoupled scheduler frontend and backend architecture, which unifies approximate SRPT and proportional-share scheduling. ALPS’ frontend sits in the user space and approximates SRPT-inspired priority scheduling by adaptively learning from an SRPT simulation on a recent past workload. ALPS’ backend uses eBPF functions hooked to CFS to carry out the continuously learned policies sent from the frontend to inform scheduling decisions in the kernel. This design adds workload intelligence to workload-oblivious OS scheduling while retaining the desirable properties of OS schedulers. We evaluate ALPS extensively using two production FaaS workloads (Huawei and Azure), and results show that ALPS achieves a reduction of 57.2% in average function execution duration compared to CFS.
more » « less
Full Text Available
A Closer Look into IPFS: Accessibility, Content, and Performance

https://doi.org/10.1145/3656015

Shi, Ruizhe; Cheng, Ruizhi; Han, Bo; Cheng, Yue; Chen, Songqing (May 2024, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

The InterPlanetary File System (IPFS) has recently gained considerable attention. While prior research has focused on understanding its performance characterization and application support, it remains unclear: (1) what kind of files/content are stored in IPFS, (2) who are providing these files, (3) are these files always accessible, and (4) what affects the file access performance. To answer these questions, in this paper, we perform measurement and analysis on over 4 million files associated with CIDs (content IDs) that appeared in publicly available IPFS datasets. Our results reveal the following key findings: (1) Mixed file accessibility: while IPFS is not designed for a permanent storage, accessing a non-trivial portion of files, such as those of NFTs and video streams, often requires multiple retrieval attempts, potentially blocking NFT transactions and negatively affecting the user experience. (2) Dominance of NFT (non-fungible token) and video files: about 50% of stored files are NFT-related, followed by a large portion of video files, among which about half are pirated movies and adult content. (3) Centralization of content providers: a small number of peers (top-50), mostly cloud nodes hosted by tech companies, serve a large portion (95%) of files, deviating from IPFS's intended design goal. (4) High variation of downloading throughput and lookup time: large file retrievals experience lower average throughput due to more overhead for resolving file chunk CIDs, and looking up files hosted by non-cloud nodes takes longer. We hope that our findings can offer valuable insights for (1) IPFS application developers to take into consideration these characteristics when building applications on top of IPFS, and (2) IPFS system developers to improve IPFS and similar systems to be developed for Web3.
more » « less
Full Text Available
Understanding Online Education in Metaverse: Systems and User Experience Perspectives

https://doi.org/10.1109/VR58804.2024.00080

Cheng, Ruizhi; Murat, Erdem; Yu, Lap-Fai; Chen, Songqing; Han, Bo (March 2024, IEEE)

Full Text Available
HardenVR: Harassment Detection in Social Virtual Reality

https://doi.org/10.1109/VR58804.2024.00033

Wang, Na; Zhou, Jin; Li, Jie; Han, Bo; Li, Fei; Chen, Songqing (March 2024, IEEE)

Full Text Available

« Prev Next »

Search for: All records